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ABSTRACT 

We determine the angular power spectrum Ci of the Edinburgh/Durham Southern Galaxy Catalog 
(EDSGC) and use this statistic to constrain cosmological parameters. Our methods for determining 
Ci and the parameters that affect it are based on those developed for the analysis of cosmic microwave 
background maps. We expect them to be useful for future surveys. Assuming flat cold dark matter models 
with a cosmological constant (constrained by COBE/DMR and local cluster abundances) and a scale- 
independent bias b, we find acceptable fits to the EDSGC angular power spectrum with l.lf < b < 2.35 
and 0.2 < < 0.55 at 95% confidence. These results are not significantly affected by the "integral 
constraint" or extinction by interstellar dust, but may be by our assumption of Gaussianity. 

Subject headings: cosmology: theory - cosmology: observation ~ cosmology: galaxies 



1. INTRODUCTION 

Over the next decade, the quantity and quality of 
' galaxy survey data will improve greatly because of a 
, variety of new survey projects underway including the 
■ Sloan Digital Sky Survey (SDSS; see York et al. 2000). 

However, most of the galaxies in such surveys will not 
. have spectroscopically-determined redshifts; therefore, the 
' study of their angular correlations will be highly profitable 
] for our understanding of the large-scale structure of the 
. Universe. 

' The primary purpose of this paper is to consider an 
analysis approach which is likely to be useful for deriv- 
ing cosmological constraints from these larger surveys. 

, In particular, we use methods that have become stan- 
dard in the analysis of cosmic microwave background 
(CMB) anisotropy maps, such as those from Boomerang 
(de Bernardis et al. 2000; Lange et al. 2000) and Maxima- 
I (Hanany et al. 2000, Balbi et al. 2000). 

Estimation of the two-point angular correlation func- 
tion w{9) from galaxy surveys without redshfit informa- 
tion has a long history. Early work (Peebles & Hauser 
1974; Groth & Peebles 1977) found the angular correla- 
tion function to vary as w{9) — 9^~^ with 7 — 1.77 and 
a break at scales larger than ~ 9h^^ Mpc. The advent of 
automated surveys, such as the APM galaxy survey (Mad- 
dox et. al. 1990) and Edinburgh/Durham Southern Galaxy 
Catalogue (EDSGC; Colhns, Nichol & Lumsden 1992) en- 
abled a much more accurate determination of w{0), as each 
survey contained angular positions for over a million galax- 
ies. 

One way to compare the measured angular correla- 
tion function with theoretical predictions is to invert w{0) 
1 
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to obtain the three-dimensional power spectrum P{k). 
This requires inverting Limber's equation (Limber 1953). 
Baugh & Efstathiou (1993, 1994) and Gaztaiiaga & Baugh 
(1998) used Lucy's algorithm (Lucy 1974) to do the inver- 
sion, while Dodelson & Gaztaiiaga (2000) used a Bayesian 
prior constraining the smoothness of the power spectrum. 
Eisenstein & Zaldarriaga (2000) used a technique based on 
singular- value decomposition to get P{k) from 'w{6). They 
point out that, once the correlations in the inverted power 
spectra are included, the uncertainties on cosmological pa- 
rameters from the APM are significantly weakened. 

Our analysis is a three-step process, similar to what is 
done with CMB data sets (Tegmark 1997; Bond, Jaffe & 
Knox 1998; Bond, Jaffe & Knox 2000). The first step is the 
construction of a pixelized map of galaxy counts, together 
with its noise properties. The second step is the deter- 
mination of the angular power spectrum C; of the map 
using likelihood analysis, together with window functions 
and a covariance matrix. In the final step, we compare 
our observationally-determined C/ to the C/ predicted for 
a given set of parameters in order to get constraints on 
those parameters. We assume that the errors in Ci are 
log-normally distributed. 

The angular power spectrum Ci is a useful intermedi- 
ate step on this road from galaxy catalog to parameter 
constraints. Estimates of the angular power spectrum, 
together with a description of the uncertainties, can be 
viewed as a form of data compression. One has converted 
the ^ 1 million EDSGC galaxies (for example) into a hand- 
ful of power spectrum constraints, together with window 
functions and covariance matrices. Thus if one wishes to 
make other assumptions about bias and cosmological pa- 
rameters than we have done here and determine the re- 
sulting constraints, one can do so without having to return 
to the cumbersome galaxy catalog. 
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We use C; instead of its historically-preferred Legendre 
transform w{9) for several reasons. First, the error matrix 
structure is much simpler: {6CiSCi>) is band-diagonal and 
becomes diagonal in the limit of full-sky coverage, whereas 
{Sw{9)Sw{9')) is much more complicated and does not be- 
come diagonal even in the full-sky limit. Second, the rela- 
tion between C/ and the corresponding three dimensional 
statistic P{k) is simpler than that between and P{k) 
(or its Fourier transform ^(r) (Baugh & Efstathiou 1994)). 

We use likelihood analysis to determine C'l because the 
likelihood is a fundamental statistical quantity. The like- 
lihood is the probability of the data given Ci, which by 
Bayes' theorem is proportional to the probability of C; 
given the data. Another advantage of likelihood analysis 
is that, as explained below, it allows for straightforward 
control of systematic errors (due to, e.g., masking) via 
modifications of the noise matrix. 

Only on sufficiently large scales do we expect the like- 
lihood function to be a Gaussian that depends only on 
C; and not any higher-order correlations. We therefore 
restrict our analysis to I values less than some critical 
value. On small scales the likelihood function becomes 
much more complicated and its form harder to predict 
a priori. Mode-mode coupling due to nonlinear evolu- 
tion leads to departures of the Ci covariance matrix from 
band-diagonal. Therefore some of the advantages of like- 
lihood analysis and the angular power spectrum are lost 
on smaller scales where other techniques may be superior. 
The Gaussianity assumption is perhaps the weakest point 
of the approach outlined here. Below, we briefly discuss 
how the analysis can be improved in this regard with fu- 
ture data sets. 

The EDSGC, with over a million galaxies and cover- 
ing over 1000 sq. degrees, offers us an excellent test bed 
for applying our algorithms (Nichol, Collins & Lunisden 
2000). We convert this catalog into a pixelized map and 
determine its angular power spectrum together with win- 
dow functions and covariance matrix. As an illustrative 
application of the angular power spectrum, we constrain 
a scale-independent bias parameter b and the cosmologi- 
cal constant density parameter Cl\ in a COi3£^normalized 
ACDM model with zero mean spatial curvature. Our con- 
straints on the bias are improved by including constraints 
on the amplitude of the power spectrum derived from num- 
ber densities of low redshift massive clusters of galaxies 
(Viana & Liddle 1999, hereafter VL99; also see Pierpaoh, 
Scott & White 2000). These number densities are sensi- 
tive to the amplitude of the matter power spectrum cal- 
culated in linear perturbation theory, near the range of 
length scales probed by the EDSGC. 

The angular power spectrum of the APM catalog 
was previously estimated by Baugh & Efstathiou (1994) 
though not via likelihood analysis. Very recently, Efs- 
tathiou & Moody (2000) have applied the same techniques 
we do here to estimating C/ for the APM survey. Their 
approach differs from ours in how they constrain cosmo- 
logical parameters. Instead of projecting the theoretical 
three-dimensional power spectra P{k) into angular power 
spectra, they transform their Ci constraints into (highly 
correlated) constraints on P{k) and then compare to the- 
oretical P{k). 

We expect the analysis methods presented here to be 



useful for other current and future data sets — even those 
with large numbers of measured redshifts. For example, 
the Sloan Digital Sky Survey (SDSS) will spectroscopically 
determine the redshifts of a million galaxies, but there 
will be about one hundred times as many galaxies in the 
photometric data, without spectroscopic redshifts. One 
can generalize the methods presented here to analyze sets 
of maps produced from galaxies in different photometric 
redshift slices. 

In §2, we review likelihood analysis and the use of the 
quadratic estimator to iteratively find the maximum of 
the likelihood function. In §3 we describe our calculation 
of P{k) and its projection to C/. In §4 we show how to 
compare the calculated C; to the measured C/ in order to 
determine parameters. In §5 we apply our methods to the 
EDSGC, and discuss some possible sources of systematic 
error in §6. This is followed by a discussion of our results 
in §7 and a brief conclusion in §8. An appendix outlines 
the derivation of the projection of P{k) to C;. 

2. THE LIKELIHOOD FUNCTION AND QUADRATIC 
ESTIMATION 

The likelihood is a fundamental statistical quantity: the 
probability of the data given some theory. According to 
Bayes' theorem, the probability of the parameters of the 
assumed theory is proportional to the likelihood times any 
prior probability distribution we care to give the param- 
eters. Thus, determining the location of the likelihood 
maximum and understanding the behavior of the likeli- 
hood function in that neighborhood (i.e., understanding 
the uncertainties) is of great interest. 

Despite its fundamental importance, an exact likelihood 
analysis is not always possible. Two things can stand in 
our way: insufficient computer resources for evaluation of 
the likelihood function (operation count scales as N^-^ and 
memory use scales as N^i-^), and, even worse, the absence 
of an analytic expression for the likelihood function. 

In this paper we assvme that the pixelized map of galaxy 
counts is a Gaussian random field — an assumption which 
provides us with the analytic expression for the likeli- 
hood function. For models with Gaussian initial conditions 
(which are the only models we consider here), we expect 
this to be a good approximation on sufficiently large scales. 
Since we restrict ourselves to studying large-scale fluctua- 
tions, we can use large pix(^ls. thereby reducing A'pix and 
ensuring that the likelihood analysis is tractable. We also 
check the Gaussianity assumption with histograms of the 
pixel distribution. On the large scales of interest here and 
for a given three-dimensional length scale, Gaussianity is 
a better approximation for a galaxy count survey than for 
a redshift survey, due in part to the redshift-space dis- 
tortions which affect the latter (Hivon et al. 1995). The 
projection from 3 dimensions to 2 dimensions also tends 
to decrease non-Gaussianity. 

Where likelihood analysis is possible, it naturally han- 
dles the problems of other estimators (such as edge effects). 
Likelihood analysis also provides a convenient framework 
for taking into account various sources of systematic er- 
ror, such as spatially varying reddening and the "integral 
constraint" discussed in §6. 

To begin our likelihood analysis, we assume that the 
data are simply the angular position of each galaxy 
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observed — though it is possible to generahzc the follow- 
ing analysis and use either magnitude information or color 
redshifts. We pixelize the sky and count the number of 
galaxies in each pixel Gj. Then we calculate the fractional 
deviation of that number from the ensemble average: 



A,: = 



(1) 



where G is the ensemble average number of galaxies per 
unit solid angle and fi; is the pixel solid angle. We do 
not actually know the ensemble mean. In practice, we 
approximate it with the survey average G. We discuss this 
approximation in §6 and demonstrate that it has negligible 
impact on our results. 

We model the fractional deviation in each pixel from 
the mean as having a contribution from "signal" and from 
"noise" , so that 

Ai = Si + ni. (2) 

The covariancc matrix, Gy , for the fractional deviation in 
each pixel from the mean is given by: 



Qj = {A, A,) = S,, + N,, 



(3) 



where Sij = (siSj) and iVy = {'^if^j) a-r^^ the signal and 
noise covariance matrices. Roughly speaking, signal is part 
of the data that is duo to mass fluctuations along the linc- 
of-sight (see the appendix), and noise are those fluctua- 
tions due to anything else. 

The signal covariance matrix Sij depends on the param- 
eters of interest (the angular power spectrum Ci) via: 
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Sij = w{0ij) = ^ CiPi {cos 9ij)e 



(4) 



where Oij is the angular distance between pixels i and j and 
we have assumed a Gaussian smoothing of the pixclizcd 
galaxy map with fwhm = \/8lri2ah. In practice, we do 
not estimate each Ci individually but binned G;s with bin 
widths greater than ^ tt/9 where is a typical angular 
dimension for the survey. 

The noise contribution to the fluctuations n is due to 
the fact that two regions of space with the same mass 
density can have different number of galaxies. We model 
this additional source of fluctuatioiis as a Gaussian random 
process with variance equal to 1/G, so that 

Nij = (mnj) = 1/G5ij. (5) 

More sophisticated modeling of the noise is not necessary 
because at all I values of interest, the variance in G; due 
to the noise is much smaller than the sample variance. 

To find the maximum of the likelihood function, we it- 
eratively apply the following equation: 

1 



C){C-^dC/dCvC-^)], (6) 
where F is the Fisher matrix given by 



SCi = ^F^J,'Tv[{AA^ 



f„, = ;tv(g-i??g 



dC. 



.1 dC 



(7) 



and for later convenience we are using C; = l{l -\- 1)Ci/{2-k) 
instead of Ci. That is, start with an initial guess of C;, 
update this to Ci + 5Ci, and repeat. We have found that 



this iterative procedure converges to well within the size 
of the error bars quite rapidly. 

The small sky coverage prevents us from determining 
each multipole moment individually; thus we determine 
the power spectrum in bands of I instead, call them "band 
powers", and denote them by Cb where 



Ci = — = 2^Xb{1)Cb 



(8) 



and Xb(i) is unity for 1<{B) < I < 1>{B) where 1<{B) and 
1>{B) delimit band B. 

Although we view equation (6) as a means of finding 
the maximum of the likelihood function, one can also treat 
Ci + 6Ci (with no iteration) as an estimator in its own right 
(Tegmark 1997; Bond, Jaffe & Knox 1998). It is referred 
to as a quadratic estimator since it is a quadratic function 
of the data. One can view equation (6) as a weighted sum 
over (AA^ — G), with the weights chosen to optimally 
change Ci so that G is closer to AA^ in an average sense. 

Various sources of systematic error can be taken into 
account by including extra terms in the modeling of the 
data (equation (2)) and working out the effect on the data 
covariance matrix, G. Below we see specific examples as 
we take into account the "integral constraint" and pixel 
masking. The reader may also wish to sec the appendix 
of Bond, Jaffe & Knox (1998), Tegmark et al. 1998 and 
Knox et al. (1998) for more general discussions. 

3. CALCULATION OF C; 

We need to be able to calculate Ci for a given theory, in 
order to compare it with G; estimated from the data. This 
calculation is a three-step process. Step one is to calculate 
the matter power spectrum P{k) in linear perturbation 
theory. Step two is to then use some biasing prescription 
to convert this to the galaxy number count power spectrum 
Paik). Step throe is to project this P{k) to G;. We further 
discuss these steps in the following subsections. 

3.1. The 3D matter power spectrum, P{k) 

We take the primordial matter power spectrum to be a 
power-law with power-spectral index n and amplitude Sjj 
at the Hubble radius. We write the matter power spectrum 
today Pf){k) (calculated using linear perturbation theory) 
as a product of the primordial spectrum and a transfer 
function T{k): 

k^P^ik) 



Voik) 



2w^ 



S%{k/Hof+"T\k) (9) 



where Ho = lOO/ikmsec"^ Mpc~^ is the Hubble param- 
eter today. The transfer function, T{k), goes to unity 
at large scales since causality prevents microphysical pro- 
cesses from altering the spectrum at large scales. At higher 
k it depends on h, ftmh, and ^Ibh'^- To calculate the trans- 
fer function wo use the semi-analytic approximation of 
Eisenstein & Hii (2000). It is also available as an out- 
put from the publically available CMBfast Boltzmann code 
(Seljak & Zaldarriaga 1996). 

Our power spectrum is now parametrized by 5 param- 
eters: n, Sh, ^mh, h and flbh^. In the following analy- 
sis, we eliminate two of these parameters by simply fixing 
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h = 0.7 and flbh'^ = 0.019. The dependence of our results 
on variations in h can be derived analytically, which we 
do in §7. Measurements of deuterium abundances in the 
Ly-a forest, combined with the dependence of primordial 
abundances on the baryon density, lead to the constraint 
nth'^ = 0.019 ± 0.002 at 95% confidence (Buries & Tytler 
1998; Buries, Nollett & Turner 2000). 

Of the remaining parameters, two more, 6h and n, can 
be fixed by insisting on agreement with both the amplitude 
of CMB anisotropy on large angular scales as measured by 
COBE/DMR, and the number density of massive clusters 
at low redshifts. The COBE constraint can be expressed 
with the fitting formula 



(jg constraints forces n to change as well. For U,m = 
0.15, 0.3, 0.35, 0.4, 1, n 1.55, 1.00, 0.91, 0.84, 0.47 respec- 
tively. One can understand this by considering the simpler 
case of 5h and erg held constant without and n depen- 
dence. Then the only effect of changing Vlmh is to change 
the transfer function. For fixed increasing f2m/i in 
this case leads to increased power on small scales. One 
therefore needs to decrease the tilt in order to keep erg un- 
changed. Now, the fact that our two amplitude constraints 
do depend on also has an effect on how n changes with 
changing However, this is a subdominant effect be- 
cause these dependences are quite similar. 



5h = 1.94 X lO^^a 



-0.785-0.05 lnJ2„ 



exp 



-0.95 (n- 1) -0.170 {n - ly 



(10) 



which is valid for the flat ACDM models we are considering 
(Bunn & White 1995). 

The cluster abundance constraint can be expressed as 
a constraint on erg which is the rms fluctuation of mass 
in spheres of radius r ~ 8/i^^Mpc, calculated in linear 
theory: 

^1= I '^{in{kr)/{kr) fVo{k) (11) 



where ji{x) — {xcos{x) — sin(a;)) /x^ . VL99 find the most 
Hkely value of erg to be erg — 0.56 Sl^" '*''. 

The reason for the choice of the scale of 8/i^^Mpc is 
that a sphere of this size has a mass of about 10^^ Mq, 
which is the mass of a large galaxy cluster. Most of the Vim 
dependence of erg comes from the fact that the pre-coUapse 
length scale corresponding to a given mass depends on the 
matter density. Thus, in a low density Universe the pre- 
collapse scale is larger, and since there is less fluctuation 
power on larger scales, the erg normalization has to be 
higher for fixed cluster abundance. 

The shift in pre-coUapse length scale with changing 

1/3 

is very slow, scaling as Vtm . Thus, although the param- 
eters that govern the shape of the power spectrum affect 
the normalization, their influence is quite small. For ex- 
ample, the scale shift for changing Vim by a factor of 3 is 
3^'''^ = 1.44, and over this range an uncertainty in n of 0.2 
translates into an uncertainty in power of 8%. 

Of course there are uncertainties in both the constraint 
from COBE and the constraint from cluster abundances. 
More significant of the two is the uncertainty in clus- 
ter abundance constraint. Consequently, we extend our 
grid of models to cover a range of values of erg where 
cTg = a^Vl~^-^'^ . VL99 find that the probability of er| is log- 
normally distributed with a maximum at ct| — 0.56 and 

a variance of In cr| of 0.251n2(l -|- 0.2017^^^^'°*^'°"'"). The 
COBE uncertainty is only 7%. We ignore this source of 
uncertainty and do not expect it to affect our results since 
such a small departure from the nominal large-scale nor- 
malization can be easily mimicked, over the range of scales 
probed by EDSGC, by a very small change in the tilt n. 

In Fig. 1 we plot Vo{k) (dashed lines) for several 
models that satisfy the COBE/DMR and VL99 con- 
straints. Changing Vtmh and also satisfying the Sh and 




Fig. 1. — Matter power spectra and C; derivatives. From bottom 
to top at low I are the COBE and cluster consistent predictions for 
flm = 0.15,0.3,1, all with 6 = 1 (dashed lines show linear theory 
predictions). The I = 20 and I = 80 curves show kdCi/dVk for for 
these two multipole moments (arbitrary normalization). 



3.2. The biasing prescription 

Although biasing in general is stochastic, non-linear, 
redshift and scale dependent, we adopt the simplest pos- 
sible model here in which the galaxy number density fluc- 
tuations are directly proportional to the matter density 
fluctuations. Then we can write b = Sg/S, where 6 = Sp/p 
is the matter density contrast, is the galaxy number 
density contrast, and b is the bias factor. 

With this description Paik) — b^P{k), where P{k) is 
the matter power spectrum. Note that above we have only 
calculated the linear theory matter power spectrum. Non- 
linear corrections are important over the EDSGC range of 
length scales, and we must incorporate these effects. We 
derive P{k) from the linear theory power spectra Po(fc) 
by use of a fitting formula (Peacock & Dodds 1996) which 
provides a good fit to the results of n-body calculations. 
The resulting power spectra are shown by the solid lines 
in Fig. 1. 

We have assumed that the galaxy number density fluctu- 
ations are completely determined by the local density con- 
trast. The number density of galaxies must also have some 
non-local dependence on the density contrast. More com- 
plicated modeling of the relationship, or "biasing schemes" 
(e.g., Cen & Ostriker 1992; Mann, Peacock & Heavens 
1998, Dekel & Lahav 1999) are beyond the scope of this 
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paper. In the applications that foUow wc assume the bias 
to be independent of time or scale, although our formalism 
allows inclusion of both of these possibilities. 

Prom analytic theory (e.g., Seljak 2000) we expect the 
bias to be scale independent on scales that are larger than 
any collapsed dark matter halos. Numerical simulations 
show this to be the case as well (see Blanton et al. 2000; 
Nayayanan, Berlind & Weinberg 2000) on scales larger 
than 10/z~^Mpc. Moreover, recent observations by Miller, 
Nichol & Batuski (2001) show that a scale- independent, 
linear, biasing model works well when scaling cluster & 
galaxy data over the range of 200 to 40/i~^ Mpc. Our 
results are determined mostly by information from these 
large scales. Since we find acceptable fits to the data us- 
ing our constant bias model, we have no evidence for a 
scale-dependent bias. 

3.3. The projection to 2D 

As described in the appendix, C; can be calculated from 
Vo{k) and the selection function as 

Ci=4tt Jvo{k)fi{kfdk/k (12) 

where 

1 f dr 

m =-^j dz-ji{kryg{z)D{z)bT^,{k, z), (13) 

r is the comoving distance along our past light cone, g(z) 
is the mean comoving number density of observable galax- 
ies, D{z) is the growth of perturbations in linear theory 
relative to ^ = and T„i(fc,.z) is the correction factor for 
non-linear evolution (Peacock & Dodds 1996). 

Equations (12) and (13) are valid for all angular scales. 
It becomes time-consuming to evaluate the Bessel func- 
tion on smaller angular scales. Although we always used 
equations (12) and (13), the reader should know that there 
is a much more rapid approximation which works well at 
I > 30: 



C, 



1 



dz^Pik = l/r, z)[g{z)D{z)bT^,{k = l/r, z)f . 



(14) 

In order to calculate C;, we need to know g{z). Since 
r'^g{z)dr/dz = dG/dz (Baugh & Efstathiou 1993; 1994; 
our appendix) it is sufficient to know dG/dz, whose mea- 
surement is described in §5. 

To give an idea of how Ci depends on V{k) we plot 
kdCi/dVik) in Fig. 1 for / = 20 and I = 80. This quantity 
is the contribution to C; from each logarithmic interval 
in k. Note that it is the breadth of these derivatives that 
explains the correlations that appear in any attempt to re- 
construct P{k) from angular correlation data. The deriva- 
tives have some dependence on cosmology; those plotted 
are for the il^ = 0.3 case. 

The angular power spectrum is sensitive not only to 
the power spectrum today, but to the power spectrum in 
the past as well. In linear theory, the evolution of the 
power spectrum is separable in k and z\ one can write 
P{k,z) ~ P(fc,0)D^(z) where D(z) is the growth factor 
well-described by the fitting formula of Carroll, Press & 
Turner (1992). We also assume that this relation holds for 
the non-linear power spectra. In truth, non-linear evolu- 
tion is more rapid at higher k than at lower k. We expect 



our approximation to therefore be overestimates of C; , but 
since wc do not use data that reach very far into the non- 
linear regime, we do not expect this error to be significant. 

4. EXTRACTION OF PARAMETERS 

To find the maximum-likelihood power spectrum, we 
have iteratively applied the binned version of equation (6). 
Although equation (6) is used as an iterative means of find- 
ing the maximum of the likelihood, it is also convenient to 
write it as the equivalent equation for Cb, instead of the 
correction 6Cb- 



Cb = ^ E ^Bh'Tr \{AA^ - N) C-'-^C 



dCr 



, (15) 



where the right-hand side is evaluated at the previous it- 
eration value of Cb, Cf"^, and Cb = Cf"^ + SCb is the 
updated power spectrum. 

We have shown how to calculate Ci from the theoretical 
parameters. We now need to calculate what Cb expect 
for this C;. One can show that the expectation value for 
Cb, given that the data are realized from a power spectrum 
Ci, is 



I'eB' 



(16) 



where the Fisher matrices on the right-hand side are eval- 
uated at C§^^, and the last line serves to define the band- 
power window function . Note that the sum over I' 
is only from 1^{B') to ly{B'). This equation reduces to 
equation (8) of Knox (1999) in the limit of diagonal Fbb' ■ 
It is this expectation value that should be compared to the 
measured Cb- 

As shown by Bond, Jaffc & Knox (2000), the probability 
distribution of C; is well-approximated by an offset log- 
normal form. In the sample-variance limit, which applies 
for our analysis of EDSGC, this reduces to a log-normal 
distribution. Therefore we take the uncertainty in each Cb 
to be log-normally distributed and evaluate the following 



XEDSGc(^m,&,0-g) = (17) 

E (inCs - InClj) CbFbb'Cb' (in Cb- - InC^,) 



BB' 



C], = Y,^Ci{n^,b,al) 



(18) 



where cs = ct| fi^" '*''. 

Our total = xIdsgc + Xvl includes the contribution 
from the cluster abundance constraint which is also log- 
normal: 

X^L = (ln«T|-ln0.56)V^' (19) 

where <j=\\u{l + 0.32 0^^^'°^!°"'") (Viana and Liddle 

1996, hereafter VL96). Note that here and throughout we 
have adopted the more conservative uncertainty in VL96, 
as opposed to the VL99 uncertainty. 
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5. APPLICATION TO THE EDSGC 

The Edinburgh/Durham Southern Galaxy Catalogue 
(EDSGC) is a sample of nearly 1.5 million galaxies cover- 
ing over 1000 deg^ centered on the South Galactic Pole. 
The reader is referred to Nichol, Collins & Lumsden (2000) 
for a full description of the construction of this galaxy cat- 
alogue as well as a review of the science derived from this 
survey. For the EDSGC data, the reader is referred to 
www . edsgc . org . 

For the analysis discussed in this paper, we consider only 
the contiguous region of the EDSGC defined in Nichol, 
Collins & Lumsden (2000) and Collins, Nichol & Lums- 
den (1992) (right ascensions 23 < a < 3 hours, through 
zero hours, and declinations —42° < S < —23°). We also 
restrict the analysis to the magnitude range 10 < 6j < 
19.4. The faint end of this range is nearly one magni- 
tude brighter than the completeness limit of the EDSGC 
(see Nichol, Collins & Lumsden 2000) but corresponds to 
the limiting magnitude of the ESQ Slice Project (ESP) of 
Vettolani et al. (1998) which was originally based on the 
EDSGC. The ESP survey is 85% complete to this limit- 
ing magnitude {bj = 19.4) and consists of 3342 galaxies 
with redshift determination. This allows us to compute 
the selection function of the whole EDSGC survey which 
is shown in Fig. 2. The data shown in this figure has been 
corrected for the 15% incompleteness in galaxies brighter 
than bj = 19.4 with no measured redshifts as well as the 
mean stellar contamination of 12% found by Zucca et al. 
(1997) in the EDSGC. These corrections are not strong 
functions of magnitude; therefore, we apply them as con- 
stant values across the whole magnitude range of the sur- 
vey. 

As mentioned above, we need to correct our power spec- 
trum estimates for stellar contamination in the EDSGC 
map. If the stars are uncorrelated (which we assume) then 
their presence will suppress the fluctuation power as we 
now explain. Let Ti be the total count in pixel i, consist- 
ing of galaxies and stars: Ti = d + Si (for simplicity, we 
consider equal-area pixels). Let a — 0.12 be the fraction 
of the total that are stars, so that G = (1 — a)T. Then, 
defining Af = {d - G) /G and Af = {Si - S) /S, we have 



= (l-a)Af + aAf. (21) 

Af is what we are after: density contrast in the absence 
of stellar contamination. The second term amounts to a 
small additional source of noise. Since, as mentioned in 
§2, the noise is completely unimportant on the scales of 
interest, we neglect this term. Therefore, 

(AfAf) = (l-a)-2(AA)- (22) 
We have accordingly corrected all our Cb estimates and 
their error bars upwards by (1 — a)^^ ~ 1.29. 

By selection function we mean dG/dz where G is the 
mean number of EDSGC galaxies per steradian. The 
smooth curve in Fig. 2 was chosen to fit the histogram, 
and is given by 

^ = 4 X 10'^ exp(-(z/0.06)3/2) (^)' . (23) 

Restricting ourselves to bj < 19.4 leaves around 200,000 
galaxies. Although this is only ~ 15% of the total number 
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Fig. 2. — The selection function for the EDSGC; i.e., the mean 
number of galaxies per steradian per redshift interval. 



of galaxies in the EDSGC, the resulting shot noise is still 
less than the fiuctuation power, even at the smallest scales 
we consider. 

We binned the map into 5700 pixels with extent 0.5° 
in declination and 0.5° in right ascension (RA). The pix- 
els are slightly rectangular with varying solid angles; the 
RA widths correspond to angular distances ranging from 
0.46° at (5 = -23° to 0.37° a.t 6 = -42°. This pixehza- 
tion is fine enough so as not to affect our interpretation of 
the large-scale fluctuations; it causes a ~4% suppression 
of the fluctuation power at / = 80. We have varied the 
pixelization scale to test this and find that with 1° x 1° 
pixels the estimated CiS change by less than half an error 
bar for I < 80. 

We also took into account the "drill holes" , locations in 
the map which were obstructed (e.g. by bright stars). In 
the case of 0.5° x 0.5° pixelization about 75 pixels were 
corrupted by drill holes. Those pixels were assigned large 
diagonal values in the noise matrix (e.g., Bond, Jaffe & 
Knox 1998), and thus had negligible weight in the subse- 
quent analysis. The 0.5° x 0.5° pixelized map is shown in 
Fig. 3. 

In Fig. 4 we plot the estimated angular power spec- 
trum from the EDSGC data. Also shown in Fig. 4 are 
our predicted CiS. For each of these, we can calculate the 
expected values of Cb by summing over the window func- 
tions, shown in the bottom panel for the six lowest / bands. 
The jaggedness results from our practice of calculating the 
Fisher matrix not for every but for fine bins of I labeled 
by b. We then assume Fw = Fw / {Sl{b)Sl{b')). 

We apply equation (17) with the sum restricted to the 
six at lowest I. First we keep ct| fixed to the preferred 
value of 0.56 (VL99) resulting in a whose contours are 
shown as the dashed lines in Fig. 5. The minimum of this 
is 8.1 for 6 — 2 = 4 degrees of freedom at flm = 0.35 
and b = 1.3, where n = 0.91. This is an acceptable x^; the 
probability of a larger is 9%- Moving towards higher 
flm decreases the VL99 preferred value of erg , and thus the 
preferred value of b increases. Increasing n„i also changes 
the transfer function, requiring a decrease in n in order 



HUTERER ET AL. 



7 




22 23 1 2 



Right Ascension (hrs) 

Fig. 3. — Map of the EDSGC that we used in our analysis (23 < ra < 3 hours and —42 < dec < —23 hours, bj < 19.4). Five of the largest 
masks are indicated with squares. 
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Fig. 4. — Angular power spectra estimated from the data and 
predicted for various models. From bottom to top at low I are the 
COBE and cluster consistent predictions for Clm = 0.15,0.3, 1 and 
b = 1 (dashed lines show linear theory predictions) . The lower panel 
shows the window functions for the first six bands. 



to agree with both COBE/DMR and cluster abundances. 
This change in the shape of the angular power spectrum 
leads to an increase in Xedsgc- Moving towards lower flm 
generates a bluer tilt to the Ci shape in two different ways. 
It leads to higher n for consistency with COBE/DMR and 
cluster abundances and it also increases the importance of 
non-linear corrections. These combined effects lead to a 
rapidly increasing Xedsgc ^'^^ ^ 

The uncertainties on ct| from cluster abundances (as we 
interpret them) are significantly larger than the EDSGC 
constraints on b for fixed Ug. If we take them into account, 
we must include additional prior information in order to 
obtain an interesting constraint on the bias. Since (at 
fixed flrn) changing changes n, prior constraints on n 
will help to constrain a|. Therefore we work with the 
total = xIdsgc +XvL+xl- From a combined anal- 
ysis of Boomerang-98, Maxima-l and COBE/DMR data, 
JafFe et al. (2000) find n = 1 ± 0.1; hence we adopt 
= ("n. — 1)^/0.1^. We marginalize the likelihood, which 
is proportional to over a^. 

Marginalizing over the amplitude constraint from clus- 
ter abundances, we find 1.07 < b < 2.33 at the best-fit 
value of ^Im = 0.35, and 1.11 < b < 2.35 after marginal- 
izing over iljn (both ranges 95% confidence). These con- 
straints correspond to the solid and dashed contours re- 
spectively in Fig. 5. Figure 6 shows the likelihood of 
bias, when marginalized either over ag, (solid line) or flm 
(dashed line). Marginalizing over the bias leads to weak 
constraints on unless one insists on allowing only 

small departures from scale-invariance. With the assump- 
tion that the primordial power spectral index is n = 1±0.1, 
we find 0.2 < V,m < 0.55 at 95% confidence. Furthermore, 
it is interesting that not only do "concordance" -type mod- 
els, with scale-independent biases provide the best fits to 
the EDSGC data, but they also provide acceptable fits. 

6. SYSTEMATIC ERRORS 

In this section we discuss three sources of systematic er- 
ror: spatially-varying extinction by interstellar dust, de- 
viation of the survey mean from the ensemble mean and 
deviation from Gaussianity. Above we have assumed their 
impact on the data to be negligible. In the following we 
use maps with three different pixelizations: BIGPIX (1.5 
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Fig. 5. — Contours of constant in the flm vs. bias plane. The 
dashed lines are for erg chosen to be at Viana and Liddle maximum- 
likelihood value. The solid line is the result of marginalizing over 
ITS, with the VL99 prior and a prior in n of 1 ± 0.1. The contour 
levels show the minimum as well as 2.3 and 6.17 above the minimum, 
corresponding to 68% and 95.4% confidence levels if the distribution 
were Gaussian. 



dog X 1.5 dog pixels, a total of iV = 650 of them), MED- 
PIX (1.0 dog X 1.0 deg, N = 1425) and FINEPIX (0.5 
deg X 0.5 deg, N = 5700). Note that FINEPIX was ul- 
timately used to obtain the cosmological parameter con- 
straints. Coarser pixclizations, however, arc easier to work 
with due to a much smaller number of pixels (in particu- 
lar, x matrices have to be repeatedly inverted in the 
quadratic estimator). 

6.1. Interstellar Dust 

The first possible source of systematic error, interstel- 
lar dust, we can dispense with quickly due to the work 
of Nichol & Collins (1993) and, more recently, Efstathiou 
& Moody (2000). The former investigated the effects of 
interstellar dust (using HI and IRAS maps as tracers of 
the dust) on the observed angular correlation function 
of EDSGC galaxies (see Collins et al. 1992) and found 
no significant effect on the angular correlations of these 
galaxies to bj = 19.5. We note that Nichol & Collins 
(1993) also investigated plate-to-plate photometric errors 
and concluded they were also unlikely to severely effect 
the angular correlations of EDSGC galaxies. Efstathiou 
& Moody (2000) used the latest dust maps from Schlegel, 
Finkbeiner & Davis (1998) to make extinction corrections 
to the APM catalog and found that for galactic latitudes 
of l&l > 20°, the corrections have no significant impact on 
the angular power spectrum. Since all the EDSGC sur- 
vey area resides at galactic latitudes of |&| > 20° and has 
been thoroughly checked for extinction induced correla- 
tions, we conclude that spatially- varying dust extinction 
has not significantly affected our power-spectrum deter- 
minations either. 



6.2. Integral Constraint 




1.0 1.5 2.0 2.5 3.0 0.20 0.30 0.40 0.50 0.60 0.70 

b 0^ 

Fig. 6. — Left panel: likelihood of b marginalized over erg (with 
n = 1 ± 0.1 prior) at Clm = 0.35 (solid line), and additionally 
marginalized over Qrn (dashed line). Right panel: likelihood of Qm 
with no priors (dotted line), n prior (dashed line), n and our VL 
priors (solid line). 



We are interested in the statistical properties of devi- 
ations from the mean surface density of galaxies. This 
effort is complicated by our uncertain knowledge of the 
mean. Our best estimate of the ensemble mean is the sur- 
vey mean. But assuming that the survey mean is equal 
to the ensemble mean leads to artificially suppressed esti- 
mates of the fluctuation power on the largest scales of the 
survey. This assumption is often referred to as "neglecting 
the integral constraint" (for discussions, see, e.g.. Peacock 
& Nicholson (1991); Collins, Nichol & Lumsden (1992)). 

Let G be the ensemble average number of galaxies in a 
pixel. Let us denote the survey average as 

G=^^G,. (24) 

Since we do not know the ensemble average, in practice we 
use the survey average to create the contrast map: 



A Gi G 1 / A \ 

A,: = — = {Ai - e) 



where 



G 



1 + e 

Gj-G 
G 



(25) 



(26) 



is the contrast map made with the ensemble average and 



G-G 
G 



(27) 
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is the fractional difference between the two averages (for 
simphcity of notation we are assuming equal-area pixels). 

Our likelihood function should not have the covariance 
matrix for A^, but instead for A,;. These are related by 

(A,A,) = (A,A,) - (e (A, + A,)) + (e^) (28) 

plus higher order terms^. The extra terms of the above 
equation are easily calculated with the following expres- 
sions: 



pix ij 



(29) 



Each correction term typically contributes 10-20% to 
the corresponding terms of the covariance matrix (they 
do not cancel, since there are two linear correction terms; 
see equation (28)). The main contribution comes from 
the lowest multipoles, corresponding to largest angles 6. 
Indeed, the correction terms come almost entirely from 
our lowest multipole bin. Dropping this bin (or using a 
ACDM Ci) reduces the correction terms to 2% or less. 

The amplitude of the correction terms can be under- 
stood from the weakness of the signal correlations on scales 
approaching the smaller survey dimension of 19°. In that 
case, we can write: 



1 



pix 



s{9)ede/n. 



(30) 



where is the area of the survey, and S{6) is the signal 
covariance, given by the RHS of equation (4) (we have 
neglected pixel noise). We plot the integrand in Fig. 7 in 
units of 5(0). 

Fortunately, even though the correction terms are not 
entirely negligible, their inclusion makes the estimated Ci 
change very little. This is shown in Fig. 8. The most 
significant change is a '--^ 20% broadening of the error bar of 
the lowest multipole. Including this effect has a negligible 
consequence on our cosmological parameter constraints. 

6.3. Gaussianity 

On large enough scales, we expect the maps to be 
Gaussian-distributed. Figure 9 shows histograms of the 
data for the three pixelizations we examined. The his- 
tograms are overplotted with the Gaussians with zero 
mean and variance equal to the pixel variance. One can 
see the improved consistency with Gaussianity as the pixel 
size increases. 

We applied a Kolmogorov-Smirnov test (e.g.. Press et 
al. 1992) to check for consistency of the above histograms 
with their corresponding zero-mean Gaussians. We find 
probabilities that these Gaussians are the parent distribu- 
tions of < 10-1°, 0.001, and 4.5% for FINEPIX, MEDPIX 
and BIGPIX respectively, indicating that Gaussianity is a 
better approximation on large scales than it is on small 
scales, as expected. We also determined the skewness of 
the maps in units of the variance to the 1.5 power, and 

2 

An exact expression to all orders is given by equation (20) of 
Gaztanaga & Hui (1999). 
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Fig. 7. — The area under the curve is approximately equal to the 
integral constraint correction terms of equation (28) in units of S(0); 
see equation (30). The assumed model is =0.3 with VL99 and 
COBE/DMK normalization. (The oscillations are due to the fact 
that only the contributions from multipole moments at I < 180 were 
included.) 
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Fig. 8. — The C; determined with and without the integral con- 
straint correction. Shown is the MEDPIX case, and abscissae of 
points were slightly offset for easier viewing. 
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Fig. 9. — Histograms of the data, ovcrplottcd with Gaussians 
centered at zero with variances equal to the pixel variances, for 
maps made with three different pixel sizes. From top to bottom 
they are: BIGPIX, MEDPIX, FINEPIX. 



find the same trend of decreasing non-Gaussianity with 
scale: 1.21, 0.85, 0.79. 

The trend with increasing angular scale and the weak- 
ness of the ^ 2(T discrepancy for the BIGPIX map are 
reassuring for our analysis that considered only moments 
I < 80. Note that a spherical harmonic with / = 80 has 
3 BIGPIX pixels in a wavelength. However, a normalized 
skewness near unity is worrisome — and this skcwness is 
not decreasing rapidly with increasing angular scale. We 
discuss possible ways of dealing with this non-Gaussianity 
in the next section. 

7. DISCUSSION 

We reduced our sensitivity to the non-Gaussianity of the 
data by restricting our cosmological parameter analysis to 
/ < 80. However, the map may still be significantly non- 
Gaussian even on these large scales. Future analyses of 
more powerful data sets that result in smaller statistical 
errors will have to quantify the effects of the Gaussianity 
assumption, which we have not done here. 

The non Gaussianity may force us towards a Monte 
Carlo approach. An analysis procedure similar to the one 
utilized here may have to be repeated many times on sim- 
ulated data — where the simulations include the non-linear 
evolution that presumably is the source of the Gaussianity. 
The distribution of the recovered parameters can then be 
used to correct biases and characterize uncertainties. 

Monte-Carlo approaches may be necessary for other rea- 
sons as well. Recently Szapudi et al. (2000) have tested 
a quadratic estimator for Ci with a simpler (sub-optimal) 
weighting scheme that only requires on the order of A'' 
operations (or N^/N operations using the new algorithms 
of Moore et al. 2001) instead of A''^. A drawback is that 
evaluation of analytic expressions for the uncertainties re- 
quires on the order of A^'' operations. Fortunately, the 
estimation of Ci is rapid enough to permit a Monte-Carlo 
determination of the uncertainties in a reasonable amount 
of time. 



Note though that Bayesian approaches may still be vi- 
able, if it can bo shown that non Gaussian analytic; expres- 
sions for the likelihood provide an adequate description of 
the statistical properties of the data. See Magueijo, Hob- 
son & Lascnby (2000) and Contaldi et al. (2000). 

To get our constraints on cosmological parameters we 
fixed the Hubble constant at 70 km/sec/Mpc, or ft, = 0.7. 
We now explain how our bias results and results scale 
for different values of the Hubble constant. 

The transfer function depends on the size of the horizon 
at matter-radiation equality Aeq which is proportional to 
l/(rim/i^), or, in convenient distance units of /i~-^Mpc, 
l/{Qmh). The latter quantity is the relevant one since 
all distances come from rcdshifts and the application of 
Bubble's law (in this case the redshifts taken for our se- 
lection function) with the result that distances are only 
known in units of h~^Mpc. Thus there is a degeneracy 
between models with the same value of flmh and different 
values of h. 

This degeneracy is broken by the Om dependence of the 

COBE-normalization of Sh and the cluster normalization 
of as- Increasing h at fixed values of i^mh means flm 
decreases, raising both Sh and erg. Ignoring non— linear 
effects, this can be mimicked by an increase in the bias 
and only a very slight reddening of the tilt (since Sh has 
risen only slightly more than erg and there is a long baseline 
to exploit). 

The end result is that our constraints on b are actually 

constraints on 6(ft/0.7)~°'^, and our constraints on il„i (at 
least when marginalized over bias) are actually constraints 
on f2„(/i/0.7). 

8. CONCLUSIONS 

We have presented a general formalism to analyze 
galaxy surveys without redshift information. We pixelize 
the galaxy counts on the sky, and then, using the quadratic 
estimator algorithm, extract the angular power spectrum - 
a procedure already in use in CMB data analysis. Just like 
in the CMB case, one effectively converts complex informa- 
tion contained in the experiment (in this case, locations of 
several hundred thousand galaxies) into a handful of num- 
bers - the angular power spectrum. One can then use the 
angular power spectrum for all subsequent analyses. 

We apply this method to the EDSGC survey. We com- 
pute the angular power spectrum of EDSGC, and com- 
bine it with COBE/DMR and cluster constraints to obtain 
constraints on cosmological parameters. Assuming fiat 
ACDM models with constant bias between galaxies and 
dark matter, we get 1.11 < 6 < 2.35 and 0.2 < < 0.55 
at 95% confidence. 

One advantage of our formalism is that it does not re- 
quire galaxy redshifts, but only their positions in the sky. 
This should make it useful for surveys with very large num- 
ber of galaxies, only a fraction of which will have redshift 
information. For example, the ongoing SDSS is expected 
to collect about one million galaxies with redshift infor- 
mation, but also a staggering one hundred million galaxies 
with photometric information only. Using the techniques 
presented in this paper, one will be able to convert that 
information into the angular power spectrum, which can 
then be used for various further analyses. 
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APPENDIX A 

LIMBER'S EQUATION 

In order to derive the equation giving Ci as a function 
of P{k) we must understand the dependence of the data 

on the 3D matter density contrast 5 = 5p/ p as a, function 
of time and space. First, we relate the number of galaxies 
per unit solid angle G observed from location r in a beam 
with FWHM = VS In 2(7 centered on the direction 7 to 
the comoving number density of detectable galaxies <?, via 

^—5 gir',To-x), (Al) 

where x = r' — r, x is the magnitude of x and tq is the 
conformal distance to the horizon today. To relate g to 6 
we simply assume that the galaxies are a biased tracer of 
the mass, so that g = g{l + bS). Therefore: 

A / .s \ G G 

A(r,7) = 



d^r' g{r',To-x), 



G 

g-|x-7r/2<^' 

2^ 



gix)b{x)5{r',To-\x\), (A2) 



where wc have allowed for a time-dependent (and therefore 
x-depcndent) bias. If we further assume that the density 
contrast grows uniformly with time, with growth factor 
D{x), then we can write: 



A(r,7) 



1 

G 



|x-7|V2<t' 

' W 9{x)bix)S{r', To)Dix). 



(A3) 

Calculating ^(^12) = (A(r, 7i)A(r, 72)) and then tak- 
ing its Legendre transform yields (after a fair amount of 
algebra): 

Ci = ^j k^dkP{k)fi{kf (A4) 

where 

m ^^j -^^ji{kx)x^-g{x)D{x)h{x) (A5) 
and F{x) enters the metric via 

ds^ = [dr^ - dx^/Fix) + x^dO^ + x^ sin^ Odcf)'^] . (A6) 

For zero mean curvature F{x) = 1; expressions valid for 
general values of the curvature are given by Peebles (1980, 
equation (50.16)). 
Note that 



G- 



I 



F{rr 
dG 



'{r)dr 



dz 



dz 



and therefore 



-^^-g{r)dr/dz = 



(A7) 



(A8) 



One can use equations (A4) and (A5) to calculate the 

expected value of Ci for any theory. The only information 
one needs from the survey to do this is g{r) or d,G/dz. The 
latter is preferable, and what wc use in our application, 
because it is directly observable as long as redshifts in 
some region are available. 



